Supervised Feature Ranking Using a Genetic Algorithm Optimized Artificial Neural Network
نویسندگان
چکیده
A genetic algorithm optimized artificial neural network GNW has been designed to rank features for two diversified multivariate data sets. The dimensions of these data sets are 85x24 and 62x25 for 24 or 25 molecular descriptors being computed for 85 matrix metalloproteinase-1 inhibitors or 62 hepatitis C virus NS3 protease inhibitors, respectively. Each molecular descriptor computed is treated as a feature and input into an input layer node of the artificial neural network. To optimize the artificial neural network by the genetic algorithm, each interconnected weight between input and hidden or between hidden and output layer nodes is binary encoded as a 16 bits string in a chromosome, and the chromosome is evolved by crossover and mutation operations. Each input layer node and its associated weights of the trained GNW are systematically omitted once (the self-depleted weights), and the corresponding weight adjustments due to the omission are computed to keep the overall network behavior unchanged. The primary feature ranking index defined as the sum of self-depleted weights and the corresponding weight adjustments computed is found capable of separating good from bad features for some artificial data sets of known feature rankings tested. The final feature indexes used to rank the data sets are computed as a sum of the weighted frequency of each feature being ranked in a particular rank for each data set being partitioned into numerous clusters. The two data sets are also clustered by a standard K-means method and trained by a support vector machine (SVM) for feature ranking using the computed F-scores as feature ranking index. It is found that GNW outperforms the SVM method on three artificial as well as the matrix metalloproteinase-1 inhibitor data sets studied. A clear-cut separation of good from bad features is offered by the GNW but not by the SVM method for a feature pool of known feature ranking.
منابع مشابه
Prediction of Cardiovascular Diseases Using an Optimized Artificial Neural Network
Introduction: It is of utmost importance to predict cardiovascular diseases correctly. Therefore, it is necessary to utilize those models with a minimum error rate and maximum reliability. This study aimed to combine an artificial neural network with the genetic algorithm to assess patients with myocardial infarction and congestive heart failure. Materials & Methods: This study utilized a m...
متن کاملPrediction of Surface Roughness by Hybrid Artificial Neural Network and Evolutionary Algorithms in End Milling
Machining processes such as end milling are the main steps of production which have major effect on the quality and cost of products. Surface roughness is one of the considerable factors that production managers tend to implement in their decisions. In this study, an artificial neural network is proposed to minimize the surface roughness by tuning the conditions of machining process such as cut...
متن کاملOptimization of Plastic Injection Molding Process by Combination of Artificial Neural Network and Genetic Algorithm
Injection molding is one of the most important and common plastic formation methods. Combination of modeling tools and optimization algorithms can be used in order to determine optimum process conditions for the injection molding of a special part. Because of the complication of the injection molding process and multiplicity of parameters and their interactive effects on one another, analytical...
متن کاملThe Predictability Power of Neural Network and Genetic Algorithm from Fiems’ Financial crisis
Organizations expose to financial risk that can lead to bankruptcy and loss of business is increased nowadays. This may leads to discontinuity in operations, increased legal fees, administrative costs and other indirect costs. Accordingly, the purpose of this study was to predict the financial crisis of Tehran Stock Exchange using neural network and genetic algorithm. This research is descripti...
متن کاملIdentifying Flow Units Using an Artificial Neural Network Approach Optimized by the Imperialist Competitive Algorithm
The spatial distribution of petrophysical properties within the reservoirs is one of the most important factors in reservoir characterization. Flow units are the continuous body over a specific reservoir volume within which the geological and petrophysical properties are the same. Accordingly, an accurate prediction of flow units is a major task to achieve a reliable petrophysical description o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of chemical information and modeling
دوره 46 4 شماره
صفحات -
تاریخ انتشار 2006